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PRIMER-SPECIFIC AND MISPAIR EXTENSION ASSAY FOR IDENTIFYING GENE 

VARIATION 

CROSS-REFERENCE TO RELATED APPLICATIONS 
[0001] This application claims priority to, and is a continuation of, International 
Application No. PCT/CA99/00733, filed on 9 August 1999, designating the United States of 
America, the contents of which are incorporated by this reference, the PCT International Patent 
Application itself claiming priority from the Canadian Application Serial No. 2,245,039 filed 13 
August 1998. 

BACKGROUND OF THE INVENTION 
[0002] Field of the Invention: The invention relates to primer-specific and mispair 
extension assay for identifying gene variations, such as in different genotypes or subtypes of a given 
genotype. 

[0003] State of the Art: Current genotyping systems are technically complex, time- 
consuming and error-prone in the detection of a single nucleotide variation and low level 
heterozygotes. 

[0004] Despite current genotyping systems, such as restriction fragment length 
polymorphism analysis (RFLP), hybridization for example, line probe assay ("LiPA"), selective 
DNA amplification by PCR-type-specific primers (Okamoto, H., et al., J. Gen. Virol 73: 673, 678, 
1992) and direct DNA sequencing having been useful in general, some technical problems still 
remain and limit their applications. 

[0005] Development of viral resistance to antiviral drugs used for treatment of HIV- 1 
infection is an important cause of treatment failure (Coffin JM., Sciences, 267: 483-489, 1995). In 
addition, drug-resistant mutations can give rise to selective cross-resistance to other antiviral drugs 
which has limited the options available for alternative antiviral regimens (Cohen OJ, and Fauci AS., 
N. Engl J. Med, 339: 341-243, 1998). Thus, genotypic drug resistance testing plays an important 
role in selecting an initial antiviral regimen and changing therapy with alternate regimens as the 
need arises. However, current genotyping assays, including direct DNA sequencing, yield 
information on only the predominant viral quasispecies due to the inability of these assays to detect 
low levels of viral variants (Gunthard, H.F., et al, AIDS Res. Hum. Retroviruses, 14: 869-876, 
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1998). An ABI automated sequencer was able to detect mutants at levels of 10-50% in an artificially 
mixed sample only when the mixes were analyzed by editing the sequences manually. 
Hybridization-based high-density oligonucleotide arrays (GeneChip) (Deeks SG, Abrams DL, 
Lancet, 349: 1489-1490, 1997) were less sensitive than the ABI and were able to detect mutations at 
levels of 25-75% in the mix (Giinthard, H.F., et aL, AIDS Res. Hum. Retroviruses, 14: 869-876, 
1998). Similar results were obtained using an automated DNA sequencer (Visible Genetics) (Hu 
YW, et aL, Reliable detection of mixed HCV genotype infections using a novel genotyping assay. 
5 th International Meeting on Viral Hepatitis C Virus and Related Viruses, Molecular Virology and 
Pathogenesis, Venezia, Italy, 1998). Line Probe Assay ("LiPA"), which uses reverse hybridization 
technology, is relatively rapid and could detect mutants at levels as low as 5% (Stuyver L, et aL, 
Antimicrob. Agents Chemother., 41: 284-291, 1997), but again, may not be suitable for detection of 
mixed genotypes because it may not give results due to nearby polymorphisms that impair 
hybridization (Stuyver L, et aL, Antimicrob. Agents Chemother., 41: 284-291, 1997). Moreover, in 
40% of the samples tested, LiPA failed to yield correct results for some of the drug-resistant 
mutations (Puchhammer-Stockl E, et aL, J. Med. Virol., 57: 283-289, 1999). Population based 
sequencing, for example, cloning and sequencing, is the gold standard method for detection of 
minor drug-resistant mutants. Unfortunately, it is impractical for clinics and large cohort studies. 

[0006] For most indirect DNA sequencing genotyping systems, a common weakness is 
that they are not as accurate as direct DNA sequencing analysis, particularly for detection of a single 
nucleotide mutation or variation, resulting in considerable instances of errors or inconsistent results 
(Andonov, A., et aL, J. Clin. Microbiol. 32: 2031-2034, 1994; Tuveri, R., et al., Journal of Medical 
Virology 51: 36-41, 1997; Okamoto, H., et aL, J. Virol. Methods 57: 31 45 002-16, 1996). 
Although direct DNA sequencing is the most reliable method for genotyping, this is not practical for 
large cohort studies. 

[0007] For example, International Publication No. WO 91/13075 describes a method for 
detecting variable nucleotides based on primer extension and incorporation of detectable nucleoside 
triphosphate using T7 polymerase for extending the primer. However, T7 polymerase does not 
have the proofreading activity and the 3 '-5' exonuclease activity of pfu results in false positive or 
false negative reactions. Moreover, the method uses ddNTPs for terminating extension. 

[0008] Another major limitation of all current genotyping systems, including direct DNA 
sequencing, is that they cannot reliably detect low levels of heterozygotes (Tuveri, R., et aL, Journal 
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of Medical Virology 51: 36-41, 1997; Lau, J.Y., et al., J. Infect. Dis. 171: 281-289, 1995; Forns, X. 5 
et aL 9 J. Clin. Microbiology. 34-10: 2516-2521, 1996) or mixed genotype infections (Tuveri, R., et 
aU Journal of Medical Virology 51: 36-41, 1997; Lau, J.Y., et al. 9 J. Infect Dis. 171: 281-289, 
1995; Forns, X., etal. 9 J. Clin. Microbiology. 34-10: 2516-2521, 1996). 

[0009] International Publication No. WO 96/30545 discloses a method for simultaneously 
analyzing a genetic mutation and a corresponding wild-type sequence within a sample. The method 
utilizes ddNTPs for terminating primer extension. However, the use of ddNTPs increases the 
background of the assay, reducing its sensitivity and capacity to detect low levels of heterozygotes, 
similarly as in WO 91/13075. 

[0010] HCV was recognized as the major etiologic agent of blood borne non-A, non-B 
hepatitis soon after the virus was identified in 1989. As an RNA virus, HCV shows great genetic 
variability, resulting in the existence of types, subtypes and quasispecies. At present, 1 1 types and 
at least 50 subtypes have been described. However, types la, lb, 2a, 2b and 3a have been found to 
be generally the most prevalent (Simmonds, P., Hepatology 21: 570-582, 1995). Subtype lb is the 
most common genotype found in Japan (Okamoto, H., et al 9 J. Gen. Virol. 73: 673, 678, 1992) and 
European countries while subtypes la and lb are the most common genotypes in the United States 
(Lau, J.Y., et al 9 J. Infect Dis. Ill: 281-289, 1995) and Canada, (Andonov, A., et al 9 J. Clin. 
Microbiol. 32: 2031-2034, 1994). Viruses of various genotypes contain different antigenic 
properties, which have potentially important consequences for the development of a vaccine and for 
antibody screening tests. Also, the disease severity and response to interferon may be influenced by 
the virus types and subtypes (Simmonds, Hepatology 21: 570-582, 1995). Subtype lb was reported 
to be associated with a high severity of the disease and low response to interferon (Simmonds, 
Hepatology 21: 570-582, 1995). It is apparent that a rapid, simple, accurate and inexpensive 
genotyping method is urgently needed. 

[0011] Amplification refractory mutation system ("ARMS") (Newton, C.R., et al. 9 Nucl 
Acids Res. 17: 2503-2516), improved the methods used in the prior art for typing the five most 
common genotypes (Pistello, M, et al 9 J. Clin. Microbiol. 32: 232-234, 1994). ARMS was 
developed for PCR detection of any point mutation in DNA using Taq DNA polymerase (Newton, 
C.R., et al 9 Nucl Acids Res. 17: 2503-2516) and is based on the principle that oligonucleotides 
with a mismatched 3-residue would not function as primers in PCR under controlled conditions. In 
some cases, however, the specificity of ARMS was insufficient to give a correct diagnosis. The 
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problem with nonspecific reactivities still remains with the type-specific primer PCR method for 
HCV genotyping, even with the improvement using ARMS. 

[0012] It is apparent that the major cause of the nonspecific reactivities found in these 
assays is related to the use of Taq DNA polymerase due to its lack of 3'^5' exonuclease activity. 
This inaccuracy results in base substitutions, transitions, tranversions, frame shifts or deletion 
mutations during DNA synthesis. Consequently, mispairs can be frequently formed, and Taq 
polymerase would be able to continue synthesizing DNA by addition of the next correct nucleotide 
on the template (Lau, J.Y., et ai, J. Infect. Dis. 171: 281-289, 1995). Even after reaching the end of 
the template, several more nucleotides can be added to the extended primer because most DNA 
polymerases, including Taq and retroviral reverse transcriptase (RT), have a nontemplate-dependent 
DNA synthesis activity, for example, terminal deoxynucleotide transferase activity (Clark, J.M., 
Nucleic Acids Res. 16: 9677; and Patel, P.H., et aL 9 Proc. Natl. Acad. Sci U.S.A. 91: 549-553). 
Therefore, the nonspecific reaction cannot be avoided with either ARMS or the methods based on 
ARMS using Taq DNA polymerase. 

[0013] Two thermostable DNA, polymerases pfu (Pyrococcus furiosus) (Lundberg, K.S., 
et aL, Pyrococcus furiousus. Gene 108: 1-6, 1991), and TLI/Vent (Thermocococcus litoralis\ 
(Neuner, A., et al 9 larch. Microbiol, 153: 205-207), exhibit 3^5' proofreading exonuclease 
activity. This ensures a high degree of amplification fidelity during DNA polymerization. Unlike 
Vent, the pfu 3'->5' exonuclease activity peaks sharply at its optimal polymerization temperature 
(75°C to 80°C), minimizing undesirable primer-degradation activity (Lundberg, K.S., et aL, Gene 
108: 1-6, 1991). Pfu DNA polymerase also does not exhibit terminal deoxynucleotidyl-transferase 
(TDT) activity, which was reported to be involved in the high mutation rate of DNA during DNA 
polymerization. 

[0014] It would be highly desirable to be provided with a simple assay or method to 
overcome many of these limitations of current DNA genotyping systems. 

[0015] It would be highly desirable to be provided with genotypic assays that possess 
greater accuracy and sensitivity for detection of minor drug-resistant subpopulations of HIV- 1 in the 
early stages of resistance evolution. 
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DISCLOSURE OF THE INVENTION 

[0016] One aim of the present invention is to provide a novel primer-specific and mispair 
extension assay (PSMEA) for determining genotypes and subtypes. 

[0017] Another aim of the present invention is to provide a primer-specific and mispair 
extension assay (PSMEA) for detecting nucleotide variations in any known gene sequence using pfu 
DNA polymerase in the presence of an incomplete set of dNTPs and only a single primer. 

[0018] Another aim of the present invention is to provide a tool for reliable detection of 
mixed genotype infection. 

[0019] Another aim of the present invention is to provide a sensitive tool for detecting 
low levels of drug-resistant mutants in patients being treated with antiviral drugs. 

[0020] Another aim of the present invention is to provide a tool for accurate genotyping. 

[0021] In accordance with the present invention, there is provided a primer-specific and 
mispair extension assay for determining genotype. The assay includes: 

a) extending a nucleic acid sequence from a patient sample with a polymerase, more preferably 
a pfu DNA polymerase, using a primer-specific for a genotype to be determined and an incomplete 
set of dNTPs, under suitable conditions for obtaining extension products of the primer, wherein at 
least one of the primer or the dNTPs is labeled; 

b) characterizing the extension products; and 

c) comparing the extension products with known nucleic acid sequences of various genotypes 
for determining the genotype of the nucleic acid sequence extended. 

[0022] Preferably, characterizing the extension products includes separating the extension 
products by size. More preferably, characterizing the extension products further includes 
sequencing the extension products after separating the extension products by size. 

[0023] Alternatively, the assay may further include amplifying the nucleic acid sequence 
before extending the nucleic acid sequence from a patient sample. 

[0024] Preferably, the incomplete set of dNTPs contains two or three different types of 
nucleotides. 

[0025] In a preferred embodiment, the primer is labeled with a radioactive label or a 
fluorescent label. 

[0026] In another embodiment, one of the dNTPs is labeled with a radioactive label or a 
fluorescent label. 
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[0027] In accordance with the present invention, there is provided a novel genotyping 
system, primer-specific and mispair extension analysis (PSMEA) preferably using the unique 3' to 
5' exonuclease proofreading properties of pfu DNA polymerase with an incomplete set of dNTPs 
and only a single primer. 

[0028] Also in accordance with one embodiment of the present invention there is 
provided a primer-specific and mispair extension assay for determining genotype and detecting low 
level mixed genotype injections or heterozygotes. The assay includes: 

a) extending a DNA sequence, amplified from a patient sample, with pfu DNA polymerase 
using a primer-specific for a genotype to be determined and an incomplete set of dNTPs, under 
suitable conditions for extending the primer, wherein at least one of the primer or one of the dNTPs 
is labeled; 

b) separating the extended DNA sequences obtained in step a); 

c) detecting the separated extended DNA sequences; and 

d) comparing the extended DNA sequences with known DNA sequences of various genotypes 
for determining the genotype of the DNA sequences extended. 

[0029] The primer may be end-labeled with a label or one of the dNTPs can be labeled 
with a label. The label can be a radioactive or fluorescent label. 

[0030] Preferably, the steps a), b), c), and d) as described above are automated. 

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS 

[0031] FIG. 1 illustrates characteristics of primer-specific and mispair extension by pfu 
and Taq DNA polymerase; 

[0032] FIG. 2 illustrates nucleotide deletions and insertions by PSMEA with primer 6AR- 
1 and 6AR-2 extensions by pfu on templates la and 6a; 

[0033] FIG. 3 illustrates a comparison of the sensitivity between PSMEA and direct DNA 
sequencing in detection of mixed genotypes; 

[0034] FIG. 4 illustrates a typical profile of unlabeled primer extensions by pfu on HCV 
templates for types la, lb, 2a, 2b, 3a and 3b using 32 P-labeled dCTP and dGTP; 

[0035] FIGS. 5 A and 5B illustrate the detection of HIV- 1 drug-resistant mutant V82A 
and wild type V82 using a universal primer and four sets of dNTPs; 
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[0036] FIGS. 6 A and 6B illustrate the detection of HIV- 1 drug-resistant mutant L63P and 
wild type L63 using matched and mismatched pairs at the 3' end of the primers in PSMEA; 

[0037] FIGS. 7A and 7B illustrate the detection of HIV-1 drug-resistant L90M and wild 
type L90 using matched and mismatched pairs at the 3' end of the primers; and 

[0038] FIG. 8 illustrates the sensitivity of PSMEA for detecting low levels of HIV-1 
drug-resistant mutant with a defined mix of mutant V82I and wild type V82. 

DETAILED DESCRIPTION OF THE INVENTION 

[0039] In accordance with one embodiment of the invention, a novel primer-specific and 
mispair extension assay (PSMEA) is provided for determining genotypes and more preferably, 
Hepatitis C virus (HCV) genotypes and subtypes. 

[0040] PSMEA is used to detect nucleotide variations in any known gene sequence using 
pfu DNA polymerase in the presence of an incomplete set of dNTPs. To test the feasibility of 
PSMEA, the 5' untranslated region ("UR") of the HCV genome, known for being highly 
conservative, was used as a model for analysis of the nucleotide variation in determining the type 
and subtype of the virus. The results demonstrate that PSMEA is a rapid, simple and accurate 
method for HCV genotyping. 

[0041] In the 5' UR of the HCV genome, six major genotypes and some subtypes can be 
classified by the nucleotide variation in this region (Simmonds, P., Hepatology 21: 570-582, 1995; 
Stuyver, L., et al.,J. Clin. Microbiol. 34: 2259-2266, 1996). 

[0042] The PSMEA of the present invention is a simple method with great potential for 
accurately detecting nucleotide mutations and which may be used for detecting nucleotide variations 
in any known gene sequence. 

[0043] PSMEA is based on the unique properties of 3^5' proofreading activity in a 
reaction with an incomplete set of dNTPs. Under such reaction conditions in accordance with one 
embodiment of the invention, pfu is extremely discriminating in nucleotide incorporation and 
proofreading at the initiation step of DNA synthesis, allowing for an accurate detection of 
nucleotide variation and heterozygotes in PSMEA. 

[0044] It is known that under this reaction condition mispair formation and extension 
occur during DNA synthesis when using reverse transcriptase and DNA polymerases, including 
enzymes that exhibit 3'— >5 ? proofreading exonuclease activity (Perinno, F., et al 9 Proc. Natl. Acad. 
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ScL USA. 86: 8343-8347, 1989; Reha-Krantz, L.J., et aL, Proc. Natl Acad. Sci. USA. 88: 2417- 
2421, 1991). However, the frequency of mispair formation and extension depends on whether the 
polymerase possesses 3'->5' exonuclease activity, the concentration of nucleotide substrates and the 
composition of the mispairs (Reha-Krantz, et aL, Proc. Natl. Acad. Sci. USA. 88: 2417-2421, 1991). 
Accordingly, the characteristics of primer-specific and mispair extension by thermostable DNA 
polymerases including pfu (Pyrococcus furiosus) (Lundberg, K.S., etal., Gene 108: 1-6, 1991), Taq 
{Thermus aquaticus) (Saiki, R.K., et aL, Science. 239: 487, 1988), and TLI/Vent (Thermocococcus 
litoralis) (Neuner, A., et aL, larch. Microbiol., 153: 205-207) were further investigated. Several 
characteristics of primer-specific and mispair extension by pfu were observed and found to be useful 
for reliable detection of nucleotide variation, deletion and insertion, as well as heterozygotes. 

[0045] In accordance with a preferred embodiment of the invention, the accuracy was 
evaluated by assaying the nucleotide variations between genotypes in the 5' untranslated region (5 ! 
UR) of the hepatitis C virus (HCV) genome. Mixed infections with more than one genotype of 
HCV were used for comparison of the sensitivity of PSMEA with other assays in the detection of 
heterozygotes. The feasibility of the method of the present invention for large cohort studies was 
demonstrated by genotyping a total of two hundred and forty-five (245) HCV isolates. The results 
show that PSMEA is an extraordinarily accurate system for identifying nucleotide variation, 
genotyping and detecting heterozygous molecules, and is readily applicable for routine use. 

[0046] The design of PSMEA in accordance with a preferred embodiment of the 
invention is based on a single primer extension by the pfu DNA polymerase in the presence of only 
dCTP and dGTP, permitting accurate detection of nucleotide variations in the 5' untranslated region 
(5'UR) of the HCV genome. In accordance with one embodiment of the invention, the HCV 
genotypes from ninety-six (96) patients and blood donors with HCV infection were determined by 
PSMEA. Seventy-four (74) of the samples were also genotyped by either the line probe assay 
("LiPA") or restriction fragment length polymorphism ("RFLP") methods. Genotypes were 
confirmed by nucleotide sequencing as required. HCV Isolates, including types and subtypes la, 
lb, 2a, 2b, 2c, 3a, 3b, 4a, 5a, and 6a, were clearly identified by the PSMEA of the present invention. 
All of the types and subtypes determined by PSMEA were matched with those identified by LiPA 
or RFLP. Five (5) isolates of subtypes la and lb that could not be typed by LiPA were clearly 
identified by the PSMEA of the present invention. 
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[0047] The primers used in the present invention were designed to meet the following 
requirements: 1) the sequence of primer binding site on template should be a type- or subtype- 
specific; 2) primers used for PSMEA should exhibit a similar melting temperature. In accordance 
with the present invention, 1 1 primers were designed for HCV genotyping with PSMEA (Table 1). 

[0048] FIG. 1 illustrates preferred characteristics of primer-specific and mispair extension 

by pju. 

[0049] FIG. 1 , column I illustrates 32 P-labeled primer extensions by pfu on template HCV 
genotype lb in the presence of dCTP and dGTP (FIG. 1, columns 1-1 and 1-2) used as markers for 
the length of primers and extended products. Column H illustrates the use of 32 P-labeled dNTPs 
instead of labeled primers for PSMEA under the same reaction conditions, showing the difference 
in primer extension between HCV genotypes la and lb (FIG. 1, columns H-l, II-2, II-3 and II-4). 
Column HI illustrates mispair formation and extension by Taq on template la with 32 P-labeled 
primers 1AB and 1BR (FIG. 1, columns HI-1 and III-2). FIG. 1, parts I', II* and III' illustrate the 
nucleotide sequences extended on templates of HCV genotypes la and lb. Primer 1AB extended 
on the antisense strand of templates la and lb. X, XX and XXX represent the sites of nucleotide 
mismatches that terminated primer extension. -> represents a nucleotide at the 3' end of the primer 
that is complementary to the opposite nucleotide in the template. (A), (C), (G) or (T) denotes the 
position of the nucleotide when a mispair is produced. -113, -108 and -99 are the nucleotide 
positions in 5'UR of HCV. The underlined sequence indicates the primer binding sites. The signs 
in this figure are used for all other figures. 

[0050] A mismatched pair with nucleotide A at position -99 of the template HCV 
genotype la that is opposite the first nucleotide to be incorporated at the 3' end of primer, could not 
be produced and subsequently, aborted the primer extension by pfu in the presence of dCTP and 
dGTP (FIG. 1, column 1-1). Under the same reaction conditions, the primer extended on the 
template of genotype lb because nucleotide A becomes G at position -99 (FIG. 1, column 1-2), 
differentiating between genotypes la and lb with this single nucleotide variation that is the only 
difference in the 5' UR between the two genotypes. 

[0051] FIG. 1 illustrates the alignment of the 5' UR nucleotide sequences of HCV 
subtypes la and lb, showing the homologous sequence( — ) and the difference of the nucleotide at 
position -99 (A for la and G for lb). The primer extension stopped at position -99 where a 
mismatched pair(s) exists (X). The arrow represents a nucleotide at 3 ? end of the primers that is 
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complementary to the opposite nucleotide in the template. The parenthesis ( ) denotes that a 
nucleotide mispair occurred. 

[0052] The absence of mispair formation and extension at position -99 is due to pfu 3'-»5' 
exonuclease proofreading activity that sharply peaks up at its optimal polymerization temperature, 
removing the mismatched nucleotide added at the initiation step of primer extension. It is 
apparently different from the 3'->5' exonuclease proofreading activity of other DNA polymerases 
such as pol-a and T4 in the primer mispair extension reaction. The mispair at the initiation site of 
DNA synthesizing can be produced with high frequency by these DNA polymerases in the primer 
mispair formation and extension reaction (Perinno, F., et al, Proc. Natl. Acad. Sci. USA. 86:8343- 
8347, 1989; Reha-Krantz, LJ., etal, Proc. Natl. Acad. Sci. USA. 88:2417-2421, 1991). 

[0053] Taq DNA polymerase without 3'->5' exonuclease proofreading activities is used in 
virtually all PCR-based genotyping assays (Okamoto, H., et al, J. Gen. Virol. 73: 673, 678, 1992; 
Newton, C.R., et al, Nucl. Acids Res., 17: 2503-2516). The unreliability caused by cross-reactivity 
and wrong priming has been a big concern (Lau, J.Y., et al, J. Infect. Dis. 171: 281-289, 1995; and 
Forns, X., et al,J. Clin. Microbiology. 34-10: 2516-2521, 1996). When Taq is used instead oipfu, 
with the other conditions of the assay being the same, the primer 1BR extension took place on 
template la despite a mismatched pair existing at primer extension initiation site (-99) (FIG. 1, 
column III-2). This clearly indicates that the major cause of these nonspecific reactivities is due to 
the infidelity of Taq during the initiation of DNA synthesis (Perinno, F., et al, Proc. Natl. Acad. 
Sci. USA. 86: 8343-8347, 1989). 

[0054] In addition, since most DNA polymerases including Taq and reverse transcriptase 
of retroviruses exhibit a nontemplate-dependent DNA synthesis activity, for example, terminal 
deoxynucleotide transferase activity ("TDT"), several more nucleotides can be added to the 3' end 
of the primer, for example, single strand DNA (Clark, J.M., Nucleic Acids Res. 16: 9677; Patel, 
P.H., et al, Proc. Natl. Acad. Sci. U.S.A. 91: 549-553). Thus, Taq would be able to continue 
synthesizing DNA by addition of a correct paired nucleotide next to the mispaired nucleotide, in 
particular, to the nucleotide in the A:C mispair that can be formed and extended more efficiently 
than other mispairs (Perinno, F., et al, Proc. Natl. Acad. Sci. USA. 86: 8343-8347, 1989; Newton, 
C.R., et al., Nucl Acids Res. 17: 2503-2516). A DNA strand with mismatched nucleotides would 
be generated and then act as a template that would be subsequently re-amplified, generating a large 
number of nonspecific DNA molecules. Therefore, the nonspecific reaction cannot be completely 
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avoided with the methods based on DNA amplification with primer-specific PCR using Taq DNA 
polymerase. In contrast, in PSMEA, only a single primer is used, so that the extended primer 
cannot be re-amplified, and any mispairs, including A:C in front of the 3' end of the primer, could 
completely stop primer extension. TLI/Vent DNA polymerase exhibits 3 ! ->5' proofreading 
exonuclease activity; however, unlike pfu, TLI/Vent severely degraded single-stranded DNA, for 
example, a primer, in PSMEA. 

[0055] In FIG. 1, column 1-1, primer extension was performed using pfu with labeled 
primers (FIG. 1, column l-\\pfu with labeled dNTPs (FIG. 1, column H-l), and Taq with labeled 
primers (FIG. 1, column IH-1). The symbol t represents where the nucleotide incorporation starts. 
The symbol (X) denotes a stop of primer extension caused by the nucleotide mispair(s). 

[0056] The second feature of specific and mispair extension by pfu is that one correct 
pairing followed by more than one mispair in front of the 3' end of the primer is not sufficient for 
primer extension (FIG. 2, columns 1-1 and 1-2). At least two consecutive correct pairings are 
required (FIG. 2, columns 1-4 and II-2). This provides a means to identify nucleotide deletion and 
insertion as well as multiple nucleotide variations or mutations in PSMEA. 

[0057] In FIG. 2, Primer 6AR-1 could not be extended with template HCV genotype la 
when using either dCTP plus dTTP, or dCTP plus dGTP as substrates due to a CA deletion (..), 
resulting either in a mispair at nucleotide position -145 in front of the 3' end of primer (FIG. 1, 
column 1-1) or in only one matched pairing at the position (FIG. 1, column 1-2). Primer 6AR-1 
could not be extended on template HCV genotype 6a when using dCTP and dGTP (FIG. 1, column 
1-3). However, the primer was extended by three bases with template HCV genotype 6a using 
dGTP and dTTP which matched the nucleotides in the CA insertion (CA) in template HCV 
genotype 6a (FIG. 1, column 1-4). Primer 6AR-2 was extended when using template HCV 
genotype 6a (FIG. 1, columns H-l and II-2), but not with template HCV genotype la using either 
dATP, dCTP and dGTP or only dGTP (FIG. 1, columns E-3 and II-4). The symbol denotes the 
removal of the first nucleotide mismatched at the 3 ? end of the primer. 

[0058] As seen in FIG. 2, there is a unique CA insertion in the 5* UR of HCV genotype 
6a. Use of the CA insertion can differentiate between HCV genotype 6a and other genotypes. In 
the instant application, the CA insertion is used as an example to show how nucleotide deletion and 
insertion could be identified. Based on the flanking nucleotide sequence of the CA insertion, two 
primers were designed. A first primer 6AR-1, located in the region before a CA insertion in HCV 
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genotype 6a, was extended only when using the correctly paired dNTPs, for example, dGTP and 
dTTP (FIG. 2, column 1-4), but not those dNTPs (dCTP and dGTP) (FIG. 2, column 1-3) that 
mismatched with the nucleotide A in the CA insertion. The second primer 6AR-2 was designed 
with the first nucleotide at its 3' end being matched with the first nucleotide A in the CA insertion of 
template HCV genotype 6a. Thus, the primer extended two bases or 11 bases on template HCV 
genotype 6a, depending on the substrates of dNTPs used (FIG. 2, columns II- 1 and II-2), but not on 
template HCV genotype la due to a CA deletion that results in a 3' mispaired residue being 
removed after primer binding (FIG. 2, columns II-3 and II-4). 

[0059] In accordance with the present invention, multiple nucleotide variations or 
mutations can be identified. For example, a unique TCA motif that is specific to HCV genotype 3a 
and 3b could be identified and is used for differentiation between the two genotypes and other 
genotypes. 

[0060] Like mispair formation and extension by T4 DNA polymerase (Wilber, J. C, et 
al, M. S. Reverse transcriptase-PCR for hepatitis C virus RNA, p. 327-331; D. H. Persing, T.F. 
Smith, F.C. Tenover, and T.J. White (ed.), Diagnostic Molecular Microbiology: Principles and 
Applications, American Society for Microbiology, Washington D.C. 1993), any two or more 
consecutive mispairs could completely terminate primer extension by pfu because of its 3'-»5' 
proofreading activity (FIG. 1, column H-2 and FIG. 2, columns 1-3 and II-2). It was also found that 
two or more mispairs, separated by one or two correct base pairs (FIG. 1, column II-l and FIG. 2, 
column It-1) could also terminate primer extension by pju. Use of the termination point caused by 
these mispairs, as well as primer-specific and mispair extensions on templates by pfu, provided 
reliable information on nucleotide sequence in the given region of the 5' UR of HCV. 

[0061] The highest molecular weight band shown on the sequencing gels represents the 
longest sequence extended to the termination point that is specific to each of the HCV genotypes. 
As indicated in FIG. 1, column II, for example, the highest band represents the primer 1AB 
extended to the nucleotide C at nucleotide position -100 before the termination point at position -99 
and -98 that would produce two mispairs with the two adjacent nucleotides A in template HCV 
genotype la, while the highest band represents the primer extended to the nucleotide G at -99 before 
the termination point at position -98 and -96 in template HCV genotype lb, showing the single 
nucleotide difference between HCV genotypes la and lb. This is another way to identify a single 
point mutation or variation in PSMEA in accordance with the present invention. 
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[0062] Whatever dNTPs, for example, one two or three of the four dNTPs, are chosen, 
they must follow the "instruction" with the characteristics of primer-specific and mispair extension 
bypfu. 

[0063] The nucleotide incorporation rate of pfu is one fifth (1/5) of that of Tag. Thus, 
pfu-based PCR applications require a minimum extension time of 2.0 minutes/kb. The efficiency of 
nucleotide incorporation by pfu is high enough for PSMEA in which only less than 20 bases of 
extension are required per cycle in the reaction. Over 50% of the excess primer with less template 
(molecular ratio of primer and template 10:1) could be extended in a 20 cycle reaction, generating 
strong signals with either 32 P-labeled primer or dNTPs. Thus, PSMEA offers not only an advantage 
of superior accuracy over current indirect DNA genotyping systems, but an extraordinary sensitivity 
for detection of mixed infections with different genotypes of HCV. 

[0064] Direct DNA sequencing is routinely used as the gold standard method for 
confirmation of the results from other assays (Tuveri, R., et ai, Journal of Medical Virology 51: 36- 
41, 1997; (Forns, X., et al. 9 J. Clin. Microbiology. 34-10: 2516-2521, 1996). In the present 
application, the sensitivity of PSMEA for identifying mixed infections was evaluated with direct 
DNA sequencing methods. In an experiment, the cDNAs (PCR products) from HCV genotypes lb 
and 2a isolates were mixed in different proportions to mimic HCV mixed infections and 
heterozygotes. FIG. 3 shows that HCV genotype 2a could be clearly identified by direct DNA 
sequencing only when it reached to 50% in proportion in the mix. When HCV genotype 2a 
molecules consisted of less than 25%, only some of nucleotide variation, points could be manually 
recognized but was not conclusive for the identification of which genotype it was. However, HCV 
genotype 2a consisting as low as about 3% in the mix was clearly detected by PSMEA, showing 
approximately 10 times more sensitivity than the direct DNA sequencing system. 

[0065] To further confirm the sensitivity of PSMEA, three samples identified having two 
or three genotypes by PSMEA were analyzed by direct DNA sequencing (FIG. 3, column B) and 
the reverse hybridization method. Results reveal that the presence of HCV genotype 2b was 
confirmed by the direct DNA sequencing method, but not by the reverse hybridization method in 
the sample containing la as dominant population. 

[0066] FIG. 3 presents autoradiography results in column A and the date of a computer 
analysis of automated sequencing in column A 1 . FIG. 3, part A, illustrates the different proportions 
of genotypes lb and 2a: 0% to 100% (I), 50% to 50% (II), 75% to 25% (HI), 87.5% to 12.5% (IV), 
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93.75% to 6.25% (V), and 96.875% to 3.125% (IV) in the mix analyzed with PSMEA FIG. 3 5 part 
A and direct DNA sequencing FIG. 3, part A'. Two HCV genotypes, la and 2b, were identified in a 
thalassaemia patient sample by PSMEA FIG. 3, part B. N in FIG. 3, part B represents the negative 
reaction with primer 6AR and the sample from the thalassemia patient. 

[0067] To evaluate the feasibility of PSMEA for large cohort studies, a total of two 
hundred and forty-five (245) samples from HCV seropositive blood donors and patients with 
chronic hepatitis were genotyped by this assay. The genotypes determined by PSMEA included la, 
lb, 2a, 2b, 2c, 3a, 3b, 4, 5a and 6a. The typeable rate of these samples with PSMEA was 95.5%. 
Eighty (80) of them were also typed with other indirect or direct DNA sequencing genotyping 
methods. The genotyping results from PSMEA were in 90-100% concordance with that from other 
genotyping methods including LiPA , RFLP and direct DNA sequencing. Less than 5% of 
untypeable samples were sequenced, indicating that those isolates being untypeable by PSMEA 
were either HCV mutants or unclassified genotypes. Results have proven a great utility of PSMEA 
for large cohort studies on viral genotyping. 

[0068] PSMEA has been further developed with an automatic and colorimetric format, 
creating a great capacity to quickly genotype a large number of HCV isolates not previously 
possible. Therefore, PSMEA has a great potential for identifying nucleotide variations and 
heterozygotes in many areas such as virology, bacteriology, human genetics, epidemiology and 
legal medicine. 

[0069] When a type- or subtype-specific sequence is available for designing the primer, 
for example primers 2A and 3A, no cross-reactivities with other genotypes are observed in PSMEA. 
Consistent results were obtained with primers 1 AB, 2B, 2C, and 3R (Table 2). 

[0070] Primer 1AB is universal for HCV genotypes la and lb (FIG. 1, column III), 
differentiating the two genotypes from other genotypes except genotype 6a due to the homologous 
sequence in the region -131 and -99 between genotypes la/lb and 6a. Fortunately, a nucleotide A 
in the unique CA insertion between the nucleotide positions -145 and -144 in the 5' UR of 6a could 
be used as a first paired nucleotide at the 3' end of primer 6AR, thus, the primer specifically 
extended on template 6a, but not on other genotypes (FIG. 2, column I). 

[0071] The primer could be bound on other genotypes, but could not be extended due to 
the CA deletion in these genotypes as compared with the insertion in 6a 5'UR. It was thus found 
that, for example, the primer 6AR-1 could not be extended because there was a single C at position - 
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145 which could be paired in the presence of dTTP and dGTP, followed by three Ts in the template 
HCA genotype la, in the reaction (FIG. 2, column 1-1). However, the primer extended on template 
6a in the same reaction conditions due to the presence of the two consecutive pairings C:G and A:T 
at positions between -145 and -144 (FIG. 2, column 1-4). Accordingly, the primer 6AR2 could be 
extended longer on template 6a when using three dNTPs (dATP, dGTP and dCTP), showing a 
stronger signal to differentiate 6a from other genotypes (FIG. 2, column II-l). In the same manner, 
primer 5AR designed with a nucleotide variation at -236 was subtype-specific. 

[0072] As shown in Table 2, primer 3B, that was originally designed for identification of 
genotype 3b, exhibited cross-reactivity with 4e in PSMEA due to the homologous sequence in the 
region -175 to -149 chosen between the two genotypes. Thus, a small region (-99 to -79) that 
contains a unique TCA motif in 5TJR of genotypes 3a and 3b was used for designing the primer 3R. 
Thus, by using primers 3A and 3R, genotypes 3a and 3b could be differentiated from other 
genotypes. Similarly, three genotypes la, lb and 6a could clearly be identified with primer 1 AB , 
1BR and 6AR that exhibited a cross-reactivity with 1 AB. A total 1 1 genotypes, including six major 
genotypes (la, lb, 2a, 2b, 3a, 3b) and some uncommon genotypes (2c, 5a, 6a, 4a and 4e), in Canada 
could be identified by PSMEA using 1 1 primers (Table 2). 

[0073] Extension of the mismatched 3 f termini of DNA is a major determinant of the 
infidelity of the DNA polymerases that have no 3 ? ->5 ! exonuclease activity (Pistello, M. et aL, J. 
Clin. Microbiol 32: 232-234). With the PSMEA of the present invention, by using 3'->5' 
exonuclease proofreading activity, high detection specificity for nucleotide mutation or variation in 
a known gene has been achieved. A significant advantage of this assay is that a single nucleotide 
variation, deletion and insertion can be accurately detected. In the genome of many natural virus 
mutants or drug-restistant mutants, there may be only a single nucleotide mutation that has potential 
genetic or clinical significance. For example, a substitution C for A at nucleotide position 1814 that 
destroys the precore initiation codon, will prevent production of HB VeAg. In some drug-resistant 
mutants, a single nucleotide mutation could cause a failure of an antiviral therapy. PSMEA of the 
present invention can thus be used for rapid screening of those mutants. 

[0074] A single point mutation could be associated with genetic disease in humans, such 
as a single point mutation, resulting in an amino acid substitution (C282Y) in the gene, HLA-H for 
haemochromatosis, which was reported to be involved in iron metabolism disorder. Such single 
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point mutations are frequently found in viruses. For example, a substitution C for A results in loss 
of precore initiation codon of hepatitis B virus, preventing e-antigen (HBVeAg) synthesis. 

[0075] The PSMEA of the present invention can be modified so as to replace 
radioactivity by a detectable label. Such a nonradioactive assay could be in the form of a 
colorimetric PSMEA in a microtiter-plate format. 

[0076] The present invention will be more readily understood by referring to the 
following examples, which are given to illustrate the invention rather than to limit its scope. 

EXAMPLE I 
DETERMINATION OF HCV GENOTYPES 

[0077] Primer extension reactions contained 20 ng primer, 20-30 ng PCR product, 20 mM 
of each dCTP and dGTP, 10 mCi of each 32 p-labeled dCTP and dGTP, 1.25 units pfu DNA 
polymerase and 10 ml 10X pfu reaction buffer (PDI). When 5'-end 32 p-labeled primers were used, 
the 32 p-labeled dCTP and dGTP were omitted, and 100 uM of each dCTP and dGTP were used in 
the reactions. The primer extensions were performed in a reaction volume of 100 ml in a 
thermocycler (Perkin Elmer, GeneAmp 9600). Twenty (20) cycles consisting of a denaturation step 
at 94°C for 20 seconds, an annealing step at 64°C for 20 seconds and an extension step at 72°C for 
35 seconds were performed. One microliter of the primer extension products was mixed with 1 uL 
of the sequencing stop solution (Pharmacia Biotech) and electrophoresed on 8% polyacrylamide 8M 
urea TBE gels for one hour. Extension products were visualized by autoradiography. 

[0078] For PCR amplification and sequence analysis, viral RNA was isolated from 100 
ml of serum by treatment with RNAzol B (Biotecx Laboratories, Houston, Tex.) as previously 
described in Wilber, J. C, Johnson, P. J., and Urdea, M. S. Reverse transcriptase-PCR for hepatitis 
C virus RNA, p. 327-331, and in D. H. Persing, T.F. Smith, F.C. Tenover, and T.J. White (ed.), 
Diagnostic Molecular Microbiology: Principles and Applications. American Society for 
Microbiology, Washington D.C. 1993. RT-PCR was performed as described by Bernier et al. 
(Bemier, L., et al., J. Clin. Microbiol. 34: 2815-1818), with a set of primers that target highly 
conserved domains within the 5-UR. The nucleotides and primers were removed from the PCR 
products with the QIAquick™ PCR Purification Kit (QIAGEN), using the procedure recommended 
by the product supplier. These purified PCR products were used for primer extension and 
automated sequencing analysis. 
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[0079] HCV isolates from patients with HCV infection were typed either by the improved 
Inno LiPA kit D™ using the procedure provided by the supplier (Innogenetics N.V., Belgium) or by 
RFLP analysis. For RFLP analysis, HCV genotypes were determined by cleavage of the PCR 
products with restriction enzymes BstNI, Bsr, Hinfl, Maelll, Haelll, BstUl and ScrFl. Digests were 
analyzed by gel electrophoresis and ethidium bromide staining. 

EXAMPLE II 
PRIMER DESIGN AND PSMEA PROCEDURE 
[0080] In accordance with a preferred embodiment of the invention, the development of 
PSMEA is based on a single primer extension in the 5'-UR of the HCV genome using pfu DNA 
polymerase in the presence of only dCTP and dGTP in the reaction. Thus, under these reaction 
conditions, primer extension occurs when only the G and/or C nucleotides are added immediately 
downstream of the 3 ? end of the primer, since the incorporation of A or T nucleotide would in many 
instances be prevented by the 3'-5 ? exonuclease activity of pfu. The primers were designed to meet 
the following requirements: 

1. They must have type-specific sequences; and 

2. All primers used for PSMEA should exhibit a similar melting temperature. 

The nucleotide sequences of the primers used in this study are shown in Table 1. In the 
PSMEA procedure, the primer extension reactions contained 20 ng primer, 20-30 ng PCR product, 
20 mM of each dCTP and dGTP, 10 mCi of each 32 P-labeled dCTP and dGTP, 1.25 units pfu DNA 
polymerase and 10 ml 10X pfu reaction buffer (Stratagene). When 5 f -end 32 P-labeled primers were 
used, the 32 P-labeled dCTP and dGTP were omitted, and 100 nM of each dCTP and dGTP were 
used in the reactions. The primer extensions were performed in a reaction volume of 100 ml in a 
thermocycler (Perkin Elmer, GeneAmp 9600). Twenty cycles of 94°C for 20 seconds, 64°C for 20 
seconds and 72°C for 35 seconds were performed. One microliter of the primer extension products 
were mixed with 1 nL of the sequencing stop solution (Pharmacia Biotech) and electrophoresed on 
8% polyacrylamide 8M urea TBE gels. Extension products were visualized by autoradiography. 
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EXAMPLE HI 
PCR AMPLIFICATION AND SEQUENCE ANALYSIS 
[0081] Viral RNA was isolated from 100 ml of serum by treatment with RNAzol B 
(Biotecx Laboratories, Houston, Tex.) as previously described in Wilber, J. C, Johnson, P. J., and 
Urdea, M. S. Reverse transcriptase-PCR for hepatitis C virus RNA, p. 327-331, and in D. H. 
Persing, T.F. Smith, F.C. Tenover, and T.J. White (ed.), Diagnostic Molecular Microbiology: 
Principles and Applications. American Society for Microbiology, Washington D.C. 1993. RT-PCR 
was performed as described by Bernier, L., et al, J. Clin. Microbiol. 34: 2815-1818, with a set of 
primers (see Table 1) that target highly conserved domains within the 5'-UR. If a second round 
PCR was necessary to provide sufficient cDNA for PSMEA, a pair of nested primers (sense primer 
-21 1 to -192 and antisense primer -91 to -74) was used. The nucleotides and primers were removed 
from the PCR products with the QIAquick™ PCR Purification Kit (QIAGEN), using the procedure 
recommended by the manufacturer. These purified PCR products were used for PSMEA and for 
automated sequencing analysis. 

EXAMPLE IV 

GENOTYPING OF HCV ISOLATES BY LiPA AND RFLP 
[0082] HCV isolates from patients with HCV infection were typed either by the improved 
Inno LiPA kit H™ using the procedure provided by the supplier (Innogenetics N.V., Belgium) or by 
RFLP analysis. For RFLP analysis, HCV genotypes were determined by cleavage of the PCR 
products with restriction enzymes BstNI, Bsr, Hinfl, Maelll, Haelll, BstUl and ScrFl, (Andonov, A., 
et al, J. Clin. Microbiol. 32: 2031-2034, 1994). Digests were analyzed by gel electrophoresis and 
ethidium bromide staining. 

EXAMPLE V 
The accuracy and reliability of PSMEA 
[0083] To evaluate the accuracy and reliability of PSMEA, 51 HCV isolates from HCV 
infected individuals in Montreal, Canada, typed by RFLP analysis in the 5' UR (Andonov, A., et al, 
J. Clin. Microbiol. 32: 2031-2034, 1994) were analyzed with PSMEA in a double blind study. The 
subtypes 2a and 2c were grouped together when RFLP was used for genotyping of the 51 isolates, 
since some 2c variants share common ScrFI cleavage sites with 2a variants. 



20 



[0084] A primer designed with sequence -133 to -113 from this isolate has shown no 
cross-reactivity with other types described in the present study (Table 2), suggesting that most 
isolates determined as 2a/2c by RFLP have a typical 2a-specific sequence, and that the primer 2A 
can discriminate the majority of subtype 2a from subtype 2c. The results from the analysis of these 
51 isolates indicate that 100% of types and subtypes for the isolates determined by PSMEA were 
matched with the types and subtypes identified with RFLP. In these isolates, there were 14- la, 14- 
lb, 5-2a, 5-2b, l-2c, 10-3a, l-3b, and l-6a. LiPA was reported to reliably type the most common 
genotypes, including some subtypes (Simmonds, P., Variability of Hepatitis C. Virus. Hepatology 
21: 570-582, 1995). Fifteen (15) isolates were typed by LiPA and evaluated with PSMEA in 
accordance with the present invention. They include 5-la, 6-lb, l-2a, l-2b, l-3a, l-3b. The results 
from PSMEA showed a 100% agreement with that from LiPA. 

[0085] A group of five (5) HCV isolates that failed to be typed or subtyped by other 
methods were clearly identified as la or lb by PSMEA. The results were confirmed by direct DNA 
sequencing using their PCR products from the 5' UR region (-211 to -71) (FIG. 4), suggesting that 
results from PSMEA is reliable. 

[0086] In the present application, the majority of the samples were typed with PSMEA 
using nonlabeled primers and 32 p-labeled dNTPs. Results obtained using 32 p-labeled dNTPs with 
unlabeled primers showed the typical patterns of the primer extensions as seen in the results from 
the reaction with 32 p-labeled primers and nonlabeled dNTPs. 

[0087] Accordingly the new genotyping assay, primer-specific and mispair extension 
assay (PSMEA), of the present invention was used to genotype HCV and to detect mixed infections. 
A total of one hundred and forty-six (146) HCV isolates were typed and analyzed with PSMEA, 
showing that nine of 110 isolates (8.2%) from HCV positive blood donors and six of 36 isolates 
(16.7%) from thalassaemia patients were found to contain more than one genotype. The results 
were confirmed and compared with other current assays including direct DNA sequencing and line 
probe assay (LiPA). PSMEA of the present invention was found to be more reliable than other 
assays in detecting mixed infection. 
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EXAMPLE VI 
Feasibility of PSMEA for large cohort studies 

[0088] Some genotypes including la, lb, 2a, and 2b show a broad geographical 
distribution, and the infection frequency of subtypes la and lb can be over 50% in blood donors and 
patients with chronic hepatitis in the United States (Lau, J.Y., et al 9 J. Infect Dis. 171: 281-289, 
1995), Canada (Bernier, L., et al., J. Clin. Microbiol 34: 2815-1818, and Andonov, A., et al, J. 
Clin. Microbiol. 32: 2031-2034, 1994), and most European countries (Simmonds, P., Hepatology 
21: 570-582. 1995, and Stuyver, L., Rosseau, et al., J. Clin. Microbiol. 34: 2259-2266, 1996). 
Other genotypes, such as 3a and 3b are less common than la and lb in those countries. However, 
3a seems to be quite frequently found in Canada. Genotypes 4, 5a and 6a are only found in specific 
geographical regions in the Middle East, South Africa and Hong Kong, respectively, but were also 
infrequently found in some areas of Canada. It is apparent that for each region, a strategy for 
genotyping a large number of HCV isolates by PSMEA has to be designed, based on the genotype 
distribution and infection frequency with the population. For example, in Canada, over 60% of 
HCV isolates are genotypes la, lb, which can be identified by primers 1 AB, 1BR. However, since 
genotype 6a was frequently found in some areas of Canada and primer 1 AB was cross-reacted with 
6a, thus all isolates diagnosed as la should be retested with primer 6AR for screening of genotype 
6a in the first round of testing with PSMEA. The nontypeable isolates using the three primers 
should be retested with primers 2A and 2B. Thus, genotypes 2a and 2b (over 15% of total isolates) 
can be determined in the second round of testing. The remaining nontypeable isolates should be 
screened with primers 3 A, 3B and 3R. Genotypes 3a and 3b (over 15% of total isolates) can be 
identified in the third round of testing. After the third round of testing, the rest of the isolates (less 
than 10% of total isolates) that cannot be typed by these primers would include some subtypes of 
type 4, subtype 5 a, subtype 2c or other genotypes. Table 3 identifies the infection frequencies of the 
major HCV genotypes identified by PSMEA, indicating that it is practical and feasible for 
genotyping a large number of isolates for epidemioloy and clinical studies. 

[0089] All current genotyping assays such as those for HCV, including direct DNA 
sequencing, are not suitable for detection of mixed infections because they are designed for 
detection of the population-dominant genotype. As indicated in Table 3, HCV mixed infection rate 
was higher than expected. A reliable detection of HCV mixed infections by PSMEA in different 
populations with HCV infection is thus reported. 
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EXAMPLE VII 
DETERMINATION OF HIV-1 MUTANTS 

[0090] HIV positive sequential plasma samples were collected from patients on the 
combination therapy with RT and protease inhibitors for greater than three months. 

[0091] Viral RNA from samples was purified using the QIAamp™ Viral RNA kit. The 
Pharmacia Biotech First-Strand cDNA Synthesis kit was then used for cDNA synthesis. PCR was 
performed to amplify the region containing the entire protease and RT regions of HIV-1. The PCR 
products were purified using the QIA quick™ PCR Purification kit for genotypic analysis with 
PSMEA, direct DNA sequencing and cloning. 

[0092] PSMEA was performed with the procedure described above. This assay has been 
developed as a semi-automated system. Accordingly, a Cy5.5 dye-labeled primer (at the 5 '-end of 
the primer) was used instead of a 32 P-labeled primer. Primer extension reactions contained 20 ng of 
5'-end 5.5 dye-labeled primer, 20-30 ng of PCR product, 20 jiM of each dNTP, 1.25 units of pfu 
DNA polymerase and 10 |liL of 10X pfu reaction buffer (PDI). Primer extensions were performed in 
a 100 ^iL reaction volume in a thermocycler (Perkin Elmer, GeneAmp 9600). Twenty cycles of 
94°C for 20 seconds, 64°C for 20 seconds and 72°C for 35 seconds were performed. One microliter 
of the primer extension products was mixed with 1 |xL of the sequencing stop solution (Pharmacia 
Biotech) and electrophoresed on 6% polyacrylamide 8M urea TBE mini gels for 10 minutes. 
Extension products were analyzed by automated DNA fragment length polymorphism analysis 
software in the OpenGene™ automated DNA sequencing system (Visible Genetics). 

[0093] Both direct DNA sequencing and population based sequencing were performed 
with an automated DNA sequencer (Visible Genetics) and DNA sequencing kits (Visible Genetics). 
For the population based DNA sequencing, the PCR products were directly cloned into the 
TOPO™ TA vector (Invitrogen Corp.). The TTV sequence was amplified from positive colonies of 
Escherichia coli using Ml 3 primers (Pharmacia Biotech). The re-amplified PCR products were 
sequenced using the same procedure as used for direct DNA sequencing. 

[0094] Based on the principle of PSMEA, primers can be designed to be either universal 
for both wild type and mutants or type-specific, for example, specific to either wild type or mutants. 
In addition, many options can be explored with different sets of dNTPs and locations of primer 
binding. To test the feasibility of PSMEA for HIV-1 genotyping, the three most common and 
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important mutation sites of codons 82 (V82), 63 (L63) and 90 (L90) were detected by PSMEA and 
direct DNA sequencing. FIGS. 5A and 5B show an example to indicate the primer design and 
expected base extensions using different sets of dNTPs for detecting mutation of V82A with a 
universal primer. In FIGS. 5 A and 5B, each combination of dNTPs results in a different extension 
profile, providing a fingerprint for wild type and mutant. In FIG. 5A, extension profiles for wild 
type and mutant are illustrated. In FIG. 5B, the peak profiles represent the number of bases 
extended as detected by automated DNA sequencing using fragment length polymorphism analysis 
software, with the first peak from left representing unextended primer. When using specific sets of 
dNTPs, ATC, ATG, ACG and TCG, the primer will extend 3, 1, 0 and 2 bases with wild type. In 
contrast, extensions of 0, 1, 5, and 2 bases will occur with mutant. These results clearly indicate that 
the predominant population in this sample was V82A mutant, showing a pattern of 0, 1, 5 and 2 
bases extended. However, in the presence of ATC, about 12% of the primer extended, indicating a 
low proportion of the wild type virus still remained in this patient. Apparently only two sets of 
dNTPs (ATC and TCG) were needed to determine the levels of mutant and wild type in a mix. In 
some cases, V82 could mutate to V82I instead of V82A. A primer was designed for detection of 
V82I mutation using a set of dNTPs (A, T, and C). 

[0095] It is more difficult to detect drug-resistant mutations at codon 63 than other sites 
because in this case there are 6 combinations of codons for Leucine (L63) and 4 codons for Proline 
(P63). Since a wild type/mutant mix would almost certainly be caused by a single base substitution, 
only four variations should be expected, with only CTC/CCC (L63) and CTT/CCT combinations 
seen so far, with the former combination being the most common. To overcome this problem, a new 
type of primer was designed for PSMEA to detect different drug-resistant mutations of L63 codon 
as follows: the target mutation point was included as part of the 3' end of two primers. The 3' end of 
one primer (PR063L(G)-R) was a perfect match to the wild type, but not mutant (FIGS. 6A(1) and 
6A(2)), while the 3' end of the second primer (PR063P(G)-R) was a perfect match to the nucleotide 
sequence of mutant, but not wild type (FIGS. 6A(3) and 6A(4)). In FIG. 6A, the matched primers 
should have 4 and 14 bases extension on both L63 and L63P ((1) and (3)), and the primers with 
mispairs on L63P and L63 were expected to have no extension (FIG. 6A, parts (2) and (4)). In FIG. 
6B, the primer extended either 4 and 14 bases (second and last peaks from the left) FIG. 6 A, parts 
(1) and (3), or virtually not at all (FIG. 6A, parts (2) and (4)). 
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[0096] If a mismatch occurs on the 3' end of the primer, the pfu will remove the 
mismatched nucleotides. A mix of 3 dNTP's was added to the reaction in which there are no dNTPs 
to correct the mismatches on the 3' end of the primer. Thus, primer extension occurs under these 
circumstances only if the 3' end of the primer was a perfect match to the target mutation point. This 
results in very little, if any, nonspecific extension, even at annealing temperatures as low as 55°C. 
When dNTPs (A, T and C) were added, the matched primer was extended with 5 and 14 bases 
extensions (FIG. 6A, parts (1), (2), and (3) and FIG. 6B, parts (1) and (3)). However the 
mismatched primer could not be extended because the mismatched nucleotides at 3' end of the 
primer were removed with no dGTP to repair the mismatch (FIG. 6A, parts (2)and (4) and FIG. 6B, 
parts (2) and (4)). 

[0097] Unlike protease codon 63 which has multiple combinations for L63 mutating to 
L63P, and codon 82 with V82 mutating to V82I/A/others, codon 90 has only one possible 
combination for L90 mutating to L90M (AAG to ATG). As a result, only one pair of primers is 
required for detection of this mutation point. FIGS. 7A and 7B show that the primer (PRO90L-R) 
was designed in a reverse direction for detection of L90. FIG. 7 A illustrates the expected number of 
bases extended with matched and mismatched primers on wild type and mutant, respectively (1-4). 
In FIG. 7B, primers PRO90L-R and PRO90M-F on wild type were extended with 4 on wild type (1) 
and 8 bases on mutant (3). 

[0098] As expected, the primer was extended with four bases (FIG. 7A, part (1) and FIG/ 
7B, part (1)). Under the same reaction conditions, the primer was not extended on mutant L90M due 
to the T:T mismatch at the 3' end of the primer (FIG. 7A, part (2) and FIG. 7B, part (2)). Another 
primer (PRO90M-F) was designed to detect low levels of drug-resistant mutant L90M. The primer 
was extended on L90M mutant with eight bases (FIG. 7A, part (3) and FIG. 7B, part (3)), but not on 
wild type due to the second to last nucleotide mismatch (A: A) at the 3' end of the primer (FIG. 7 A, 
part (4) and FIG. 7B, part (4)). 

[0099] In order to assess the sensitivity of PSMEA for detecting low levels of mutants in 
a viral population, an artificial mixing experiment was carried out. PCR products from the protease 
region of V82 wild type and the V82I mutant were mixed in the following proportions between 
mutant and wild type: 3.1% (1/32), 1.6% (1/64), 0.8% (1/128) and 0.0% (for example, 100% wild 
type). FIG. 8 shows that 0.8% mutant in the mix was clearly detected with PSMEA. The levels at 
0.8%-3.1% were not detected by direct DNA sequencing. It has previously been reported that the 
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mixed genotypes or quasispecies could be detected only when mixed genotype(s) reached levels 
over 25% (Deeks SG, Abrams DL, Lancet, 349: 1489-1490, 1997). 

[0100] To confirm the sensitivity of PSMEA, a total of 32 isolates from patients being 
treated with antiviral drugs have been analyzed with both direct DNA sequencing and PSMEA for 
the presence of drug-resistant mutations in codon V82, L63 and L90 from the protease region of 
HIV-1. Table 4 shows a 100% concordance of the genotypic testing results between PSMEA and 
direct DNA sequencing for detection of predominant population of variants including mutants and 
wild type. 

[0101] Twenty-five (25) (78.1%) of 32 samples determined as being folly wild type 
(V82) were found to contain either mutants V82I or V82A at various levels (1-19%). Different 
levels (3-27%) of L63P mutant were detected by PSMEA in 100% (5/5) of the samples that were 
determined as purely wild type (L63) by direct DNA sequencing. However, in detecting L90M 
mutation, the results showed 86.7 % (13/15) concordance between the two methods. 

[0102] One of the samples (AR283) was analysed using population based sequencing, 
direct DNA sequencing and PSMEA to further evaluate the accuracy of PSMEA in detecting low 
levels of drug-resistant mutants. Table 5 shows that PSMEA demonstrated greater concordance with 
population based sequencing than direct DNA sequencing for monitoring minor variants. Seventy- 
six percent (76%) and twenty percent 20% of isolate from AR283 was found to be V82I mutant and 
wild type V82, respectively, by PSMEA, but 100% mutant V82I by direct DNA sequencing. 
Population based sequencing showed that wild type truly existed at a level of 12.5% (4/32). For 
detection of predominant populations variants (mutant V82I and wild type L90) in the isolate, both 
direct DNA sequencing and PSMEA were 100% concordant with population based sequencing. 

[0103] From the above, it is thus apparent that PSMEA is more sensitive than direct DNA 
sequencing for detection of low levels of drug-resistant mutants. The high degree of sensitivity of 
PSMEA should allow one to more precisely understand how early drug resistant mutations can 
develop, especially during antiretroviral therapy. For clinics, this may further aid in guiding 
clinicians in modulating therapeutic regimens before the appearance of phenotypic drug resistance 
or deterioration in clinical status. With early warning of drug specific mutations, agents can be 
chosen to impede resistance and minimize cross-resistance, yet still have an antiviral effect on the 
patient's dominant quasispecies. This, in turn, will hopefully delay therapeutic failure and increase 
survival. 
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SEQUENCE LISTING 

<110>Hu,Yu-Wen 

<120> PRIMER-SPECIFIC AND MISPAIR EXTENSION ASSAY FOR IDENTIFYING GENE 
VARIATION 
<130> 2883-4757US 
<160> 49 

<170> Patentln version 3.0 
<210> 1 
<211> 25 
<212> DNA 
<213> Artificial 

<220x223> sense universal primer for PCR (first round) 
<400> 1 

ctcccctgtg aggaactact gtctt 25 

<210> 2 
<211> 20 
<212> DNA 
<213> Artificial 

<220x223> antisense universal primer for PCR (first round) 
<400> 2 

ctcgcaagca ccctatcagg 20 

<210> 3 
<211> 31 
<212> DNA 
<213> Artificial 

l<220x223> sense universal primer for PCR (second round) 
<400> 3 

ccatagtggt ctgcggaacc ggtgagtaca c 31 

<210> 4 
<211> 18 
<212> DNA 
<213> Artificial 

<220x223> antisense universal primer for PCR (second round) 



<400> 4 

cccaacacta ctcggcta 

<210> 5 
<211> 21 
<212> DNA 
<213> Artificial 

l<220x223> primer for PSMEA 
<400> 5 

ctcaatgcct ggagatttgg g 

<210> 6 

<211> 20 

<212> DNA 

<213> Artificial 

<220x223> primer for PSMEA 

<400> 6 

caccggaatt gccaggacga 

<210> 7 

<211> 21 

<212> DNA 

<213> Artificial 

<220x223> primer for PSMEA 

<400> 7 

acactactcg gctagcagtc t 

<210> 8 

<211> 21 

<212> DNA 

<213> Artificial 

<220x223> primer for PSMEA 

<400> 8 

ccactctatg cccggtcatt t 



<210> 9 

<211> 21 

<212> DNA 

<213> Artificial 

<220x223> primer for PSMEA 

<400> 9 

tatgtccggt catttgggca c 

<210> 10 

<211> 21 

<212> DNA 

<213> Artificial 

<220x223> primer for PSMEA 

<400> 10 

cactctgtgc ccggccattt g 

<210> 11 

<211> 19 

<212> DNA 

<213> Artificial 

<220x223> primer for PSMEA 

<400> 11 

accggaatcg ctggggtga 

<210> 12 
<211> 19 
<212> DNA 
<213> Artificial 

K220x223> primer for PSMEA 
<400> 12 

accggaatcg ccgggatga 

<210> 13 
<211> 21 
<212> DNA 



29 



<213> Artificial 

<220x223> primer for PSMEA 

<400> 13 

cactactcgg ctagtgatct c 21 



<210> 14 
<211> 19 
<212> DNA 
<213> Artificial 

K220x223> primer for PSMEA 
<400> 14 

gggggtcctg gaggctgtt 



<210> 15 

<211> 22 

<212> DNA 

<213> Artificial 

<220x223> primer for PSMEA 

<400> 15 

cattgagcgg gtttgatcca at 22 



<210> 16 
<211> 12 
<212> DNA 
<213> Artificial 

<220x223> Extension product of wild-type HIV-1 
<400> 16 

agtccacatc ct 12 



<210> 17 
<211> 12 
<212> DNA 
<213> Artificial 

<220x223> Extension product of wild-type HIV-1 



<400> 17 
cgtccacatc ct 
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<210> 18 
<211> 41 
<212> DNA 
<213> Artificial 
<220> 

<223> L63 + primer sequences 
<220> 

<221> misc feature 

<222> (31)..(41) 

<223> n can be any nucleotide 

<400> 18 

gctttatgtc cacagatttc tatgagtacc tnatcatact n 



<210> 19 
<211> 44 
<212> DNA 
<213> Artificial 

<220x223> L63P + primer sequences 
<400> 19 

agacagtatg atcaggtact catagaaatc tgtggacata aagc 

<210> 20 
<211> 26 
<212> DNA 
<213> Artificial 
<220x223> pro63L(G)-R 

<400> 20 

gctttatgtc cacagatttc tatgag 

<210> 21 
<211> 44 
<212> DNA 
<213> Artificial 



<220x223> L63P + primer sequences 
<400> 21 

agacagtatg atcaggtacc catagaaatc tgtggacata aagc 



<210> 22 
<211> 41 
<212> DNA 
<213> Artificial 

<220x223> L63 + primer sequences 

<220x221> misc_feature 

<222> (32)..(41) 

<223> n can be any nucleotide 

<400> 22 

gctttatgtc cacagatttc tatgggtacc tnatcatact n 



<210> 23 
<211> 26 
<212> DNA 
<213> Artificial 
<220x223> PRO90L-R 

<400> 23 

gctttatgtc cacagatttc tatggg 



<210> 24 
<211> 33 
<212> DNA 
<213> Artificial 
<220x223> L90 

<220x221> misc_feature 

<222> (32)..(33) 

<223> n can be any nucleotide 

<400> 24 

atttaaagtg caaccaatct gagtcttcag arm 



<210> 25 
<211> 33 
<212> DNA 
<213> Artificial 
<220x223> L90 

<400> 25 

aatctgaaga ctcagattgg ttgcacttta aat 



<210> 26 
<211> 27 
<212> DNA 
<213> Artificial 
<220x223> PRO90L-R 

<400> 26 

atttaaagtg caaccaatct gagtctt 

<210> 27 
<211> 33 
<212> DNA 
<213> Artificial 
<220x223> L90M 

<400> 27 

aatctgatga ctcagattgg ttgcacttta aat 



<210> 28 
<211> 38 
<212> DNA 
<213> Artificial 
<220x223> PRO90M-F 

<220x221> misc_feature 

<222> (37)..(38) 

<223> n can be any nucleotide 

<400> 28 

ctgtcaacat aattggaaga aatctgatga cgcagann 



<210> 29 
<211> 38 
<212> DNA 
<213> Artificial 
<220x223> L90M 

<400> 29 

aatctgcgtc atcagatttc ttccaattat gttgacag 

<210> 30 
<211> 28 
<212> DNA 
<213> Artificial 
<220x223> PRO90M-F 

<400> 30 

ctgtcaacat aattggaaga aatctgat 

<210> 31 
<211> 38 
<212> DNA 
<213> Artificial 
<220x223> L90 

<400> 31 

aatctgagtc aacagatttc ttccaattat gttgacag 

<210> 32 
<211> 13 
<212> DNA 
<213> Artificial 

<220x223> Extension product of a mutant HIV-1 

<220x221> misc_feature 

<222> (12)..(13) 

<223> n can be any nucleotide 



<400> 32 
agttgtatta ann 



13 
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<210> 33 
<211> 18 
<212> DNA 
<213> Artificial 

<220x223> Extesion product of a wild HF/-1 
<400> 33 

tcaacataat tggaaaga 18 



<210> 34 

<211> 22 

<212> DNA 

<213> Hepatitis C virus 

<220x221> miscfeature 

<223> nucleotide sequence on templates for HCV genotype la 
<400> 34 

tttgggcgtg cccccgcaag ac 22 



<210> 35 

<211> 22 

<212> DNA 

<213> Hepatitis C virus 

<220x221> misc feature 

<223> nucleotide sequence on templates for HCV genotype lb 
<400> 35 

tttgggcgtg cccccgcgag ac 22 



<210> 36 

<211> 18 

<212> DNA 

<213> Hepatitis C virus 

<220x221> misc_feature 

<222> (16)..(18) 

<223> N can be any nucleotide 

<220x221> misc_feature 

<223> nucleotide sequence on templates for HCV genotype la 
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<400> 36 

cgcgggggcg cgcccnnn 1 8 

<210> 37 

<211> 22 

<212> DNA 

<213> Hepatitis C virus 

<220x221> misc_feature 

<223> nucleotide sequence on templates for HCV genotype la 
<400> 37 

gtcttgcggg ggcacgccca aa 22 

<210> 38 

<211> 13 

<212> DNA 

<213> Hepatitis C virus 

<220x221> misc_feature 

<223> nucleotide sequence on templates for HCV genotype la 
<220x221> misc_feature<222> (12)..(13)<223> n can be any nucleotide 
<400> 38 

cgcgcccccg cnn 13 



<210> 39 

<211> 22 

<212> DNA 

<213> Hepatitis C virus 

<220x221> misc feature 

<223> nucleotide sequence on templates for HCV genotype lb 
<400> 39 

gtctcgcggg ggcacgccca aa 22 

<210> 40 

<211> 15 

<212> DNA 

<213> Hepatitis C virus 
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<220x221> misc_feature 

<223> nucleotide sequence on templates for HCV genotype lb 

<220x221> misc feature 

<222> (13)..(15)<223> n can be any nucleotide 

<400> 40 

cgcgcccccg cgnnn 15 



<210> 41 

<211> 22 

<212> DNA 

<213> Hepatitis C virus 

<220x221> misc_feature 

<223> nucleotide sequence on templates for HCV genotype la 
<400> 41 

tttgggcgtg cccccgcaag ac 22 



<210> 42 

<211> 18 

<212> DNA 

<213> Hepatitis C virus 

<220x221> misc_feature 

<223> nucleotide sequence on templates for HCV genotype la 
<400> 42 

cgcgggggcg cgcccaaa 18 



<210> 43 

<211> 22 

<212> DNA 

<213> Hepatitis C virus 

<220x221> misc_feature 

<223> nucleotide sequence on templates for HCV genotype la 

<220x221> misc_feature 

<222> (17)..(22) 

<223> n can be any nucleotide 
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<400> 43 

aggaccgggt cctttcnntt gg 22 



<210> 44 

<211> 22 

<212> DNA 

<213> Hepatitis C virus 

<220x221> misc_feature 

<223> nucleotide sequence on templates for HCV genotype 6a 
<400> 44 

aggaccgggt cctttccatt gg 22 



<210> 45 

<211> 22 

<212> DNA 

<213> Hepatitis C virus 

<220x221> misc_feature 

<223> nucleotide sequence on templates for HCV genotype 6a 
<400> 45 

aggaccgggt cctttccatt gg 22 



<210> 46 

<211> 15 

<212> DNA 

<213> Hepatitis C virus 

<220x221> misc_feature 

<223> nucleotide sequence on templates for HCV genotype 6a 

<220x221> miscjeature 

<222> (15)..(15) 

<223> n can be any nucleotide 

<400> 46 

ggaaagggac ccggn 15 



<210> 47 
<211> 22 
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<212> DNA 

<213> Hepatitis C virus 

<220x221> misc_feature 

<223> nucleotide sequence on templates for HCV genotype la 

<220x221> miscjeature 

<222> (17)..(22) 

<223> n can be any nucleotide 

<400> 47 

aggaccgggt cctttcnntt gg 22 

<210> 48 

<211> 48 

<212> DNA 

<213> Hepatitis C virus 

<220x221> misc_feature 

<223> nucleotide sequence on templates for HCV genotype 2a 
<400> 48 

tggccgggca tagagtgggt ttatccaaga aaggacccag tcttcccg 48 



<210> 49 

<211> 48 

<212> DNA 

<213> Hepatitis C virus 

<220x221> misc_feature 

<223> nucleotide sequence on templates for HCV genotype lb 



<400> 49 

tctccaggca ttgagcgggt tgatccaaga aaggacccgg tcgtcctg 
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Table 3 

The prevalence of HCV mixed genotype infections in different populations determined by PSMEA 
and direct DNA sequencing 





PSMEA 


Direct DNA 
sequencing 


Blood donors 


7/72 


ND 




(9.7%) 




Patients with chronic hepatitis C 


15/80 


8/80 




(18.8%) 


(10.0%) # 


Thalassemia patients 


6/36 


4/36 




(16.7%) 


(11.1%) 



# The samples determined to contain more than one genotype by direct DNA sequencing were 
100% concordant with that determined by PSMEA. Since the direct DNA sequencing method is not 
sensitive enough to detect low levels (<25%) of mixed genotype infections, the percentage of mixed 
genotype infections determined by direct DNA sequencing is lower than what was found by 
PSMEA. 
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Table 4 



Comparison of the sensitivity between direct DNA sequencing and PSMEA for detecting low levels 





Direct DNA Sequencing 


PSMEA 




L63/P/ 


V82/I/A L90/M 


L63 


L63P 


V82/V82I/V82 


L90/L90 


Sample 


ULOj 1 


(yv/ivij 


y W/1VLJ 


(%\ 


/L63 


A 






M 




ID 


\ W / IVl ) 








r 


(%) 
\ /0 ) 






V /u / 




4B 


T f\1 


Voz 


T Qfi 






Q1 


7 


2 






6B 
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97 
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7 


93 


JB 


T A^P 


\/Q7 A / 

V 


T QOM/ 
T 






31 
J X 


1S 


54 


56 


44 


MN 




\/R7 A 
V oZ/V 


T QOM 


0 


100P 


22 


10 


68 


5 
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T £^P 


V oZ 
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o 
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Q 
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7 


100 


0 


DS1 


T £^P 


V OZ 
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8fi 


12 


2 






DS2 
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\/Q7 
V oZ 
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-J 


97P 

/ X 


95 

y-j 




7 






GP 


T A7P 


VoZ 


t on 






Q1 


1 


\> 
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0 


AS 




V oZV\ 








14 

X*T 


4 


82 






FP 


T A^P 


VoZ 
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QO 

y\j 


7 




100 

J. \J\J 


o 


M15-8 




VoZ 


T QH 


4Q 


sir 


yj 










M15-10 
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t on 


/ J 


?7r 


y\j 


u 


4 






M17-17 




V oZ V/l 


t on 


SO 


soc 


71 


27 


2 






Ml 7-22 
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T OH 


4R 

*tO 


52c 

~jz. 


QO 
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C 
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2Q 
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M17-41 


T 


V82 


L90 


Q7 




93 


4 


3 






Ml 7-44 


L63P 


V82 


L90M 






93 


5 


2 






98027 


L63 


V82 


L90 


96 


4P 


89 


9 


2 


99 


1 


V894713 


L63 


V82 


L90 






89 


9 


2 


100 


0 


AR283 


L63P 


V82I 


L90 






20 


76 


4 


100 


0 


AR929 


L63P 


V82I/V 


L90 






33 


65 


2 


100 


0 


AR2429 


L63P 


V82V/I 


L90 






39 


60 


1 


100 


0 


AP6423 


L63L/P 


V82 


L90 






90 


8 


2 


100 


0 


AP6617 


L63P 


V82 


L90 






89 


10 


1 


100 


0 


AP7010 


L63P 


V82 


L90 






90 


9 


r 


100 


0 


AP7366 


L63P 


V82 


L90 






90 


8 


2 


100 


0 



The percentage of mutant in a mix is calculated as follows: 
% mutant = (% wild type extension + % mutant extension) x 



100 



♦ # 



Table 5 

Evaluation of the accuracy of PSMEA and direct sequencing for detection of low levels of variants 
with population based sequencing 



Method 



Population based 
sequencing 

Direct DNA 
sequencing 

PSMEA 



Genotypes 
%V821 %V82 
87.5(28/32)* 12.5 (4/32) 



100.0 



76.0 



0.0 



20.0 



%L90M 



0.0 



0.0 



%L90 



0.0(0/32) 100.0(32/32) 



100.0 



100.0 



* Denotes that a total number of mutant or wild type found in 32 cloned PCR products sequenced. 



